THE EXPECTED GENUS OF A RANDOM CHORD DIAGRAM 



NATHAN LINIAL AND TAHL NOWIK 

Abstract. To any generic curve in an oriented surface there corresponds an oriented chord 
diagram, and any oriented chord diagram may be realized by a curve in some oriented 
surface. The genus of an oriented chord diagram is the minimal genus of an oriented surface 
in which it may be realized. Let g n denote the expected genus of a randomly chosen oriented 
chord diagram of order n. We show that g n satisfies: 

TL 

9n = - - 6 (Inn). 



1. Introduction 

The study of plane curves dates back to C.F. Gauss. Gauss in [5] has attached to any 
plane curve with n double points a 2n letter word, as follows. To each double point attach a 
letter, and then register the letters you encounter as you travel along the curve. One obtains 
a word of length 2n, where each of the n letters appears precisely twice. Such a word is 
called a Gauss word. Clearly not any Gauss word may be realized by a plane curve, and 
Gauss has pointed out a necessary condition for it to be realizable. One can enhance the 
Gauss word of a curve with a mark on one of the two occurrences of each letter, signifying 
that the corresponding strand of the curve crosses the curve at the given double point from 
right to left. Various necessary and sufficient conditions for a Gauss word to be realizable 
in the plane have been eventually given, both in the marked and unmarked settings. See 
[3], [7], [10], [11], [12] and references therein. 

To avoid the arbitrariness of the assignment of letters to the different double points, 
one can replace Gauss words with chord diagrams. To a marked Gauss word corresponds an 
oriented chord diagram, which is by definition a division of the set {1, . . . ,2n} into n ordered 
pairs. This may be represented as a circle with 2n designated points, and n oriented chords 
connecting pairs of these points. 
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Though not every oriented chord diagram may be realized by a curve in the plane, it 
may be realized by a curve in some oriented surface. J.S. Carter in [2] has given a direct 
construction for the minimal genus surface in which a given diagram may be realized, as 
follows. Take an annulus, which is thought of as a regular neighborhood of the curve, and 
identify n pairs of regions along the annulus according to the prescription of the diagram. 
One obtains an orientable surface F with some d boundary components. Capping off the 
boundary components with discs produces the required minimal genus surface. The genus of 
this surface is g — |(n + 2 — d) and so < g < |(n + 1). We will refer to this minimal genus 
as the genus of the given diagram. A diagram being realizable in the plane is equivalent to 
it having genus 0. 

For fixed n, we are interested in the distribution of genera of the oriented chord diagrams 
with n chords, and we ask: What is the expected genus g n of a randomly chosen diagram? 
We show that the expected genus is very close to the maximal possible genus |(n + 1), in 
fact, we show that 

9n = ^ - O(mn). 

Since g — |(n + 2 — d), this is equivalent to showing that the expected number of boundary 
components is O(lnn). 

Though the above description is geometric, counting the number of boundary components 
may be described in a purely combinatorial manner. Traveling along a boundary component 
of our surface corresponds to a walk on the chord diagram according to the following rule. 
When moving along the circle of the diagram, and arriving at an end of a chord, continue your 
motion along the chord to its other side. If your motion along the chord is in the direction 
(respectively, against the direction) of its orientation, then continue your motion along the 
circle in the same (respectively, opposite) direction as you have moved before entering the 
chord. So, to identify a boundary component, one travels along the diagram according to 
the above rule, until returning to the starting point. Repeating this process, one obtains all 
boundary components. 

This combinatorial walk along a chord diagram is reminiscent of the walk along the cycles 
of a permutation. The distribution of the cycles of a random permutation on n letters is 
well understood, and the expected number of cycles is also 6 (Inn). Indeed, it is for similar 
reasons that the expected number of cycles in a permutation and the expected number of 
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cycles of the walk along a chord diagram are both 6 (Inn), though as will be seen, the setting 
of chord diagrams is substantially more complicated. 

Since this problem may be formulated both in a topological and a purely combinatorial 
manner, it may be of interest to both topologists and combinatorialists, and indeed the text 
is aimed for both audiences. This work may be viewed as part of the recent expanding 
interest in probabilistic questions in topology, as appears in [1],[4],[6],[8],[9]. 

2. Definitions and statement of result 

Let F be an oriented surface. A generic curve in F is an immersion c : S* 1 — > F for which 
the only self intersections are transverse double points. We fix n once and for all, and call a 
point of S 1 a dot if it is one of the 2n points {e 7n/C//n : 1 < k < 2n}. If a generic curve c 
has n double points, then there are 2n points in S* 1 mapped into them, and we will always 
assume that these 2n points are precisely our 2n dots. A generic curve with such n double 
points will be called an n-curve. 

An oriented chord diagram of order n is a division of the set of dots into n disjoint ordered 
pairs. One can represent an oriented chord diagram in the plane, by drawing an oriented 
chord connecting each ordered pair, where the orientation of the chord represents the order 
of the pair. An oriented chord diagram will also be called simply a diagram. 

We denote the set of all diagrams of order n by V n , and we have \V n \ = Any 
n-curve determines a diagram D(c) G V n as follows. The double points of c divide the 2n 
dots into pairs, and the orientation of the surface F induces an ordering on each pair, in the 
following way. If c(a) = c{b) for dots a, b, and c'(6),c'(a) is a positive basis with respect to 
the orientation of the surface, then the ordered pair (a, b) is taken. See Figure 1. 

Any diagram may be realized by a curve on some oriented surface, and a regular neighbor- 
hood of the curve in all such surfaces is the same, as we now explain (compare [2]). Given a 
diagram D e T> n , take the annulus A = S 1 x [— e, e] with a fixed orientation, and identify S 1 
with S* 1 x {0}. For each dot a let S a C A be the 2e x 2e square centered at (a, 0). We now 
identify pairs of dots a, b according the prescription of D, and we identify the corresponding 
squares S a , Sb with a positive or negative | rotation, so that the self intersection of the curve 
at the identified square will be as prescribed by the orientation of the chord between a and b 
in D. We obtain an oriented surface with d(D) > 1 boundary components, which we denote 
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Figure 1. Chord (a, b). 

F(D). Clearly, the embedding S 1 — > A given by z i— > (z, 0), composed with the quotient 
map A — > F(D), is an n-curve S" 1 — > -F(-D) which realizes D, and a regular neighborhood of 
any curve in any surface realizing D, is identical to F(D). It follows that the genus of the 
closed surface obtained by capping off the d(D) boundary components of F(D) with discs, 
is the minimal genus of a surface in which D may be realized. 

Definition 2.1. Given a diagram D, we define g{D), the genus of D, to be the minimal 
genus of a closed oriented surface admitting a curve c : S 1 — > F with Z?(c) = D. 

The image in F(D) of our n-curve is a graph with n vertices and In edges, and so 2 — 
2g(D) = n - In + d(D), or, = |(n + 2 - d(D)). Since > 1, we deduce that 

0<g(D)<l(n + l). 

In this work we study the following question: What is the expected genus of a randomly 
chosen diagram D G T> n l We will show that the expected genus is close to the maximal 
possible genus \{n+ 1). More precisely we will show: 

Theorem 2.2. The expected genus g n of a random oriented chord diagram D e V n satisfies: 

n 

9n = - - O(mn). 

We think of g(D) and d(D) as random variables defined on a randomly chosen D e I^n- 
That is, our sample space is V n , each diagram having equal probability ^yy- We denote the 
expected values by g n = E[g] and d n = E[d]. We will show that d n = O(lnn), from which 
Theorem 2.2 follows via g n — |(n + 2 — d n ). 
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Figure 2. Dots, squares, edges, and their labels. 
3. The random procedure 



We label the dot e mk l n ) and its corresponding square, simply by k, and so k± 1 will mean 
addition mod In. The interval along the boundary of the annulus A between two adjacent 
squares will be called an edge, so we have 4n edges. We orient the edges according to the 
orientation induced on dA from that of A, and we denote the oriented edge from square a 
to square b by [a, b]. The edge [b, a] will then be the parallel edge in the other boundary 
component. So, all edges in S 1 x {e} are of the form [a, a+ 1], and they will be called positive 
edges, and all edges in S 1 x {— e} are of the form [a+ 1, a] and will be called negative edges. 
See Figure 2. 

We would like to see how our edges are attached to each other due to the gluing of two 
squares. We will say an attachment [a, b] — [c, d] takes place if the end point of [a, b] is glued 
to the beginning point of [c, d]. So, say we have identified the two squares a and b according 
to the oriented chord (a, 6). As seen in Figure 3, the eight edges involved are attached to 
each other as follows: [a — 1, a] — [b, b + 1], [b + 1, b] — [a, a + 1], [a + 1, a] — [b,b — 1], 
[b — 1, b] — [a, a — 1]. This can be summarized by the following rule: 

Rule 3.1. An attachment [a, b] — [c,d] holds if the oriented chord (6, c) exists, and the signs 
of [a, b] and [c, d] are the same, or if the oriented chord (c, b) exists and the signs of [a, b] and 
[c, d] are opposite. 

Traveling along a boundary component of the surface F(D) corresponds to a walk along 
the diagram D, which by Rule 3.1 proceeds as follows: When moving along the circle of the 
diagram, and arriving at an end of a chord, continue your motion along the chord to its other 
side. If your motion along the chord is in the direction (respectively, against the direction) of 
its orientation, then continue your motion along the circle in the same (respectively, opposite) 
direction as you have moved before entering the chord. 
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Figure 3. Edge attachments due to chord (a, b). 

We may thus read all boundary components directly from the diagram D, as follows: 
Choose an arbitrary edge, and start traveling along the diagram in the above way, alter- 
natingly passing edges and chords, until you return to your initial edge. Then choose some 
unvisited edge, and similarly travel until you return to it. Continue until all edges have been 
visited. Notice that when this is done, each chord of the diagram has been visited four times, 
each visit corresponding to one of the four corners of the glued square. 

For k < n, a £;-n-diagram is a choice of 2k out of the 2n dots, and a division of these 2k 
dots into k oriented chords, i.e. ordered pairs. The remaining 2n — 2k dots will be called 
vacant dots. So, an oriented chord diagram of order n is an n-n-diagram. 
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Definition 3.2. A path in a &-n-diagram D, is a sequence [cti, 61] — [a 2 , b 2 ] — ■ ■ ■ — [a r , b r ] 
of distinct edges, attached via oriented chords of D according to Rule 3.1. A path in D is 
called a loop if the attachment [a r , b r ] — [01, bi] also holds. A path in D is called a segment 
if a\ and 6 r are vacant dots (perhaps the same dot). Loops and segments are precisely those 
paths that cannot be further extended. 

We now specify our procedure for choosing a random n-n-diagram. Our procedure will 
choose the chords one by one. We first fix an ordering e±, . . . , e 4 „ of our edges, once and for 
all. Before the procedure begins, we announce e\ as the "pointer" edge. Assume that after 
the (j — l)th step, we have already chosen j — 1 oriented chords, and the pointer edge lies in 
a segment (rather than a loop) of the given (j — l)-n-diagram. The next chord is now chosen 
with one of its dots being the concluding dot of the segment in which the pointer lies, and 
its other dot is randomly chosen from the other In — 2j + 1 vacant dots. The orientation 
of the new chord is also randomly chosen. If in the new j-n-diagram, the pointer's segment 
continues to be a segment, i.e. it does not close into a loop, then the same edge remains 
the pointer. If on the other hand, after the jth chord is added, the pointer's segment closes 
into a loop, then the edge with smallest index which lies in a segment in the new j-n- 
diagram, becomes the new pointer. This procedure indeed produces all n-n-diagrams with 
equal probability. 

Examples of two runs of our random procedure appear in Figures 4 and 5, demonstrating 
some of the interesting features of the possible evolution of the pointer's segment. In both 
figures the edge e\ is the edge [a, a + 1] and is marked by 1. 

In Figure 4 the chords are chosen in the following order: (a + 1, b), (c, b + 1), (c — 1, c + 
1), (a, a + 2). The segment of e± after the four steps of this run is [a + 3, a + 2] — [a, a + 1] — 
[b, b + 1] — [c, c — 1] — [c + 1, c] — [b + 1, b] — [a + 1, a + 2] — [a, a — 1] and these edges are 
marked in the figure by 0, . . . , 7, in this order. The evolution of ei's segment throughout the 
four steps of this run is 1-2, 1-2-3, 1-2-3-4-5-6, 0-1-2-3-4-5-6-7. 
Note that after the third step the segment revisits some chords that were chosen in previous 
steps. Note that after the fourth step the segment extends in both directions, and so 1 is no 
longer the first edge in the segment. We point out the following difference between this run, 
and the run of Figure 5. In Figure 4 the initial and final dots of the pointer's segment are 
distinct, whereas in Figure 5 it is the same dot. 
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Figure 4. A four step run of the random procedure. 

4. Upper bound for d n 

In this section we establish our upper bound for d n . If £ is a loop in an n-n-diagram D, 
then the size of £ is the number of distinct chords visited by £. If k is the size of the loop £ 
and r is the number of edges it visits, then since £ alternatingly visits an edge and a chord, 
and since each chord is visited at most four times, we have k < r < 4k. 

For given n, let = Lk(n) denote the expected number of loops of size A; in a random 
n-n-diagram, then d n = Ylk=i ^k- We will show in Proposition 4.5 below, that for k < j^, 
Lk < §• On the other hand, since any chord is visited by at most four different loops, the 
total number of all loops of size k > -r^ is at most 400, and so its expected value y\^_n_ Lk 

1UU - K ^ 100 

is at most 400. Together this gives 



d n 



E 

fc< 



L k + Y L k< Yl ^ + 400 < 3 Inn + 400. 



100 



K> 100 



In order to obtain our bound L/, < | we will need to bound the probability that at a given 
kth step, the pointer's segment closes into a loop. We now prove the following: 



Proposition 4.1. For k < j^, the probability that the pointer's segment will close into a 
loop at the kth step is at most 




Let S be the pointer's segment in our (k — l)-n-diagram after step k — 1, and let p be 
its concluding dot. We now need to choose the kth chord, with one end being p. We must 
determine how many choices will result in closing S into a loop. Let q be the initial dot of S. 
If q ^ p (as occurs in the example in Figure 4, where q = a + 3 and p — a — 1), then for S to 
be closed into a loop, we must choose q as the second dot for the fcth chord, and so the choice 
of unoriented chord is unique. Though usually only one of the two choices of orientation for 
this chord will indeed close the segment S into a loop (as is the case in Figure 4), it may 
in fact occur that both orientations accomplish this. Since we are seeking an upper bound 
for the probability, we will always count both orientations as possible, or in other words we 
will ignore the choice of orientation in the computation of the probability. Since there are 
2n — 2k + 1 vacant dots from which we may choose the second dot for the new chord, the 
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probability of choosing the correct dot q is 2w -2fc+i ' anc ^ s i nce we assume k < we have 
i < hi 

2n-2fc+l — In ' 

If q — p (as in the example of Figure 5, where q — p — a), then at first sight it may seem 
that closing the pointer's segment into a loop is impossible, and in most cases this is in fact 
true. But on the other hand, there are cases with q = p where not only does there exist a 
choice of chord that closes the given segment into a loop, but there are in fact many such 
choices. The example in Figure 5 is such case. The oriented chords (a, a + 4), (a, a + 7) or 
(a, a + 11) may each be added in the present step to close the pointer's segment into a loop. 
For the analysis of this phenomenon, we define the following notion. 

Definition 4.2. A plug is a segment [a±, bi] — • • • — [a r , b r ] for which a\ = b r . The vacant 
dot d\ = b r is called the entrance to the plug. 

Examples of two plugs are depicted in Figure 6. In Figure 6a, the chord (a — l,a + 1) 
produces the plug [a, a — 1] — [a + 1, a] with entrance a. In Figure 6b, the chords (a + 1, a + 3) 
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and (a + 4, a + 2) produce the plug [a, a + 1] — [a + 3, a + 4] — [a + 2, a + 3] — [a + 1, a] with 
entrance a. Note that the same vacant dot can be the entrance to two different plugs. 

In our present case, where q — p, the pointer's segment is itself a plug, but this fact is not 
of interest to us. What enables us to close the pointer's segment into a loop in Figure 5, is 
each one of the additional plugs that are present in the given 5-n-diagram, namely the three 
plugs [a+A,a+3]-[a+5,a+A], [a+7, a+6] - [a+8, a+7], and [a+ 11, a+ 10] - [a + 12, a+ 11] 
(each of which is similar to the plug in Figure 6a). In fact, the next lemma shows that in 
order to close the pointer's segment into a loop in the case q = p, it is necessary that the 
second dot of the new chord will be an entrance to a plug. 

Lemma 4.3. Let D be a (j — 1) -n- diagram, and let a be a vacant dot in D. Let e be an edge 
entering a, (i.e. e is [a — 1, a] or [a + 1, a]). Assume Q is an additional oriented chord with 
one end at a and the other end at some other vacant dot b, such that in the j-n diagram 
obtained by adding Q, the path beginning at e leads to an edge e' which is one of the two 
exiting edges at a (i.e. e' is [a, a + 1] or [a, a — 1]). Then b is the entrance to a plug in D. 

Proof. Assume b is not an entrance to a plug. In Figure 7, edge e is marked, and the two 
possibilities for e' are marked. The path beginning at e passes the point x, and in order for 
it to lead to e', it must eventually arrive back into the region depicted in the figure. It does 
not arrive at y\ or y 3 since we have assumed that b is not an entrance to a plug. If it arrives 
at t/4 then it closes a loop without passing either possibility for e'. So, it must arrive at y-2 
as depicted, and it then exits our region through point z. This time its only possibility for 
returning is through t/ 4 , which as before prevents it from ever arriving at either possibility 
for the exiting edge e'. □ 

As we have seen, there may be many plugs available for completing our segment into a 
loop, but fortunately, the expected number of available plugs is small. The main technical 
effort of this work is the following proposition whose proof we defer to Section 6. 

Proposition 4.4. For k < the expected number of plugs present after k steps of the 
random procedure is at most \. 

Back to the proof of Proposition 4.1 for the case q = p. By Lemma 4.3, in order for S to 
close into a loop, the second dot we choose for the new chord must be the entrance to some 
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Figure 7. Proof of Lemma 4.3. 



plug. By Proposition 4.4, the expected number of plugs existing in the present stage of the 
random procedure (i.e. after k — 1 steps), is at most |. Together with the case q ^ p we have 
on average at most 1 + | choices for the new unoriented chord. (As before, we ignore the 
additional choice of orientation.) Note that we must take the sum and not the maximum of 
the bounds for the two possibilities q ^ p and q = p, since the expectation for the number 
of plugs that we bound in Proposition 4.4 is not conditional on q = p. 

We obtain that for k < the probability that the pointer's segment closes into a loop 
at the kth step of the random procedure is at most (1 + \) • |^ < which completes the 
proof of Proposition 4.1. 

Proposition 4.5. Let Lk denote the expected number of loops of size k in a random n-n- 
diagram. Then for k < we have Lk < § . 

Proof. It is clear from the definition of our random procedure, that if the segment of e\ 
closes into a loop at the A;th step, then this loop is of size k. So the event that the edge t\ 
lies in a loop of size k is the same as the event that e\ survives as pointer until step k, and 
then at step k its segment closes into a loop. The probability for this event is at most the 
probability that at the kth step the pointer's segment closes into a loop, and by Proposition 
4.1 this probability is at most Now, our random procedure produces each n-n-diagram 
with equal probability, and so by the symmetry of our annulus, the probability for any given 
edge to lie in a loop of size k is also at most j-, or alternatively, the probability that a 
randomly chosen edge will lie in a loop of size k is at most j-. 
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We obtain a lower bound for P k by noting that each loop of size k includes at least k 
edges, and the total number of edges is An, and so P k > Together we get ^ < P k < 
which proves our claim. □ 

As already explained above, the bound L k < § for k < implies the following upper 
bound for d n : 

d n < 3 Inn + 400. 



5. Lower bound for d n 

We have asked in the proof of Proposition 4.5, what is the probability P k that a randomly 
chosen edge will lie in a loop of size k. We have noticed that this is precisely the probability 
that in our random procedure, the segment of t\ survives until the A;th step, and then at the 
A;th step it closes into a loop. In this section we will find a lower bound for P k , for n > 50 
and k < \/n, which in turn will provide a lower bound for d n . 

A run of j — 1 steps of the random procedure is called good if after these j — 1 steps e\ 
is still the pointer, and its segment is of the form \a\, b±] — ■ ■ ■ — [aj, bj] with all the dots 
ai, bi, . . . , cij, bj being distinct. When j — 1 = 0, i.e. before starting the random procedure, 
then the pointer's segment is simply e\ = [ai, b±\, so the run is good. If the run is good after 
j — 1 steps, and at the jth step the second dot chosen for the new chord is not adjacent to 
any of the dots a\, bi, . . . , aj, bj, then the run is still good after the jth step. This restriction 
for the choice of the jth chord means that if an edge of the segment is say [a, a + 1], then 
the four dots a — l,a, a+l,a + 2 are not chosen. So, at the jth step we have at most Aj dots 
which we are forbidden to choose, so the number of allowed choices for a new dot at the jth 
step is at least 2n — Aj. Since the total number of dots from which we choose is 2n — 2j + 1, 
the probability for such restricted choice at the jth step is at least 2 ^"~ j 4 ^ 1 . 

If after k — 1 steps of the random procedure the run is still good, then in particular, the 
initial and final dots of ei's segment are distinct. So, at the kth step there exists a choice 
of oriented chord that closes ei's segment into a loop, and the probability for this choice 
is 2r2n-2fc+ii • S° together, for n > 50 and k < y/n, the probability that the segment of e\ 
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survives until the kth step, and then at the kth step closes into a loop satisfies: 

P t >_^ -n 2 "- 4i >^f[(i- 23+1 )>±.T\g-i±L 

2 (2n-2k + l 1J -2n-2j + l ~ 4n 1J - V 2n - 2? + r ~ An 11V n - jfc 

7 = 1 J j=l _ 7 = 1 



k— 1 

1 t— I- 6 3 + 1 1 (i 3+1 1 a fc 2 + fc 1 3 n+y7T X 
^> IIC 5 n ~ fc = 6 5 ^J = l n — fc ^> g 5 n — fc ^> g 5 n— ^> 

- 4 n 1 1 4 n - 4 n ~ 4n ~ 9n ' 

j'=i 

(We use the assumption n > 50 in the fourth and last inequalities.) 

As before, let L k be the expected number of loops of size A; in a random n-n-diagram, then 
since the number of edges in a loop of size k is at most 4k we have P k < Together, for 

n > 50 and k < y/n we get ^ < Pjt < so L fc > ^. We may now establish our lower 

bound for d n , for n > 50: 

^11 1 
^ = ^L fc >^->-lnv^=-lnn. 

k=l k=l 

Together with the upper bound of Section 4 we obtain d n = 0(ln n), which proves Theorem 
2.2, stating that the expected genus g n of a random diagram of order n satisfies: 

9n = | - 6(lnn). 

6. Upper bound for the expected number of plugs 

In this section we prove Proposition 4.4, stating that for k < ^j, the expected number of 
plugs present in our £;-n-diagram after k steps of the random procedure is at most \. 

Definition 6.1. Two vacant dots in a /c-n-diagram D are called neighbors, if they are the 
two end points of a segment in D. 

Definition 6.2. A positive plug is a plug [ai,&i] — ••• — [a r ,6 r ] for which the two edges 
[ai,&i], [o r ,6 r ] are of the same sign, that is, they are of the form [a, a + 1], [a — 1, a] or 
[a, a — 1], [a + 1, a]), (as in Figure 6a). A negative plug is a plug for which these two edges 
are of opposite sign, that is, they are of the form [a, a + 1], [a + 1, a] or [a, a — 1], [o — 1, a], 
(as in Figure 6b). Note that if same vacant dot is the entrance to two different plugs, then 
these two plugs must be of the same sign. 

Lemma 6.3. Under the assumptions of Lemma 4-3, if e and e' are of opposite sign, and ifb 
is not the entrance to a positive plug (and so by Lemma 4-3 it is the entrance to one or two 
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FIGURE 8. Proof of Lemma 6.3. 

negative plugs), then either a and b are neighbors (Definition 6.1), or a is also an entrance 
to a plug. 

Proof. Assume a and b are not neighbors. In order for us to arrive at e', given that b is not 
the entrance to a positive plug and a and b are not neighbors, our path must be as in Figure 
8, which shows that a is the entrance to a (negative) plug. □ 

Any chord is involved in at most four different segments, and so at each step, when adding 
a new chord, at most four new plugs can be created. But we will show that in fact the 
expected number of plugs created at each step k < is at most ^. This implies that the 
expected number of plugs present after k < steps is at most \. To establish this bound 
we will in fact need to prove the following more detailed proposition, which distinguishes 
between positive and negative plugs. 

Proposition 6.4. The following holds for k < j^: 

(1) Let G^ (respectively GZ) denote the expected number of positive (respectively negative) 
plugs completed at the kth step. Then G~l < | and GZ < ^. 

(2) The expected number of plugs present after k steps is at most |. 

(3) Let (respectively HZ ) denote the probability that after the kth step the concluding 
dot of the pointer's segment is an entrance to a positive (respectively negative) plug. 
Then H+ < $ and HZ < ^. 

K — n k — n 
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Proof. (1) Say at the kth step we have chosen a chord Q between dots a and b, and a plug 
has been completed, with dot c being its entrance. This means that after adding Q there 
is a segment with edges e ix — e i2 — • • • — e ir beginning and ending at the vacant dot c, and 
before adding Q this segment did not exist. This means that before adding Q, the segment 
Si beginning with e ix ended at some vacant dot a ^ c, and the segment S2 ending with ei r 
began at some vacant dot a' 7^ c. We now distinguish three cases as follows. If a ^ a! then 
the new chord Q must be between a and a'. By definition of our random procedure, the 
concluding vacant dot p of the pointer's segment is one of the dots of the new chord Q, so 
must be either a or a'. We will refer to this case as Case A. If on the other hand a = a' then 
the new chord Q must be between a and some other vacant dot b. In this case either p = a 
or p = b, and these two possibilities will be referred to as Case B and Case C, respectively. 

For Case A, we note that there are at most four different segments with one end being p. 
The other end of each such segment is a vacant dot that may be c of the above description. For 
each such c there is a unique second segment with which a configuration Si, S2 as described 
above may arise for a positive plug, and a unique such second segment for a negative plug. 
Our assumption is that a 7^ a' and so for each such configuration there is a unique choice of 
unoriented chord with which such a plug may be created. As discussed in Section 4, it may 
be that both choices of orientation for this chord bring to the completion of the plug. So here 
and in all following cases, we do as we have done in Section 4, and include both choices in our 
count by simply ignoring the choice of orientation. As before, the probability for the correct 
unoriented chord to be chosen in each case is 7, — hrrr since there are 2n — 2k + 1 additional 

2n— 2fc+l 

vacant dots, and for k < we have 2n -2k+i — ^ ne contribution of this case to 

and is at most 4 • Note that it may be that different configurations in our count 
are completed into a plug by the same choice of chord, but by the additivity of expectation, 
the contributions of all configurations may be added regardless of the dependence between 
them. 

In Case B, Q is between the dot p = a and the dot b, and by Lemma 4.3, b must be 
an entrance to an existing plug. We bound all possible contributions that may be from 
choosing the second dot of the new chord as the entrance to an existing plug. Any new 
chord may participate in at most four different segments, and so at most four new plugs 
may be completed. By induction, we may use (2) of the present proposition for k — 1 to 
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conclude that on average we have at most \ existing plugs available. So, the contribution is 
on average at most 4 • \ • ^ = h^. We cannot determine how this contribution will divide 
between G k and G k and so we add it to both. 

In Case C, p — b, and our choice of the second dot a for Q is such that a is part of a 
configuration of segments Si, S 2 and dots c, a as described above. The segments Si and S 2 
may or may not pass chords, but there is just one special configuration for Si, S 2 in which 
both Si and S 2 do not pass any chord, namely, the configuration where a and c are adjacent 
dots along the annulus, and Si, S 2 are the two edges connecting them. If the configuration 
is not this special one, then necessarily the dot c is adjacent along the annulus to a dot that 
is the end of one of the k — 1 existing chords. So there are at most 4(k — 1) possibilities for 
such dot. For each such dot c there are two possibilities for a pair of segments Si, S 2 that 
may give rise to a positive plug, and two possibilities for a negative plug. Together this gives 
at most 8A; possible pairs of segments for positive plugs and for negative plugs. Now we note 
that in order for us to be in Case C, our dot p must be at the entrance to an existing plug 
after step k — 1. By induction we may use (3) of the present proposition for k — 1 to conclude 
that the probability for us being in Case C is at most f + And so the contribution of the 
non-special configurations to G k and G k is at most 8fc(-^ + ^7)-^ < 8 • ^ • ^ • < 3 • 

For the special configuration, if a plug is completed then it is necessarily a negative plug, 
so contributes only to G k . If p — b is the entrance to a positive plug, which happens, by 
induction on part (3), with probability at most -, then we take our bound to be simply the 
total number of choices 2n — Ik + 1 for a. There may be a special configuration on each side 
of a, and so the contribution to G k is at most 2 • (2n — 2k + 1) • | • < 24 • 

If p — b is the entrance to a negative plug, which happens by part (3), by induction, with 
probability at most ^, then by Lemma 6.3, we must choose a which is either a neighbor of 
b or the entrance to a plug. The dot b has at most 4 neighbors. For each such neighbor a 
there is at most one special configuration that may be completed into a plug, since it may 
not be on the side of a where the segment from b arrives at a. So, the contribution of this 
case is at most 4 • ^ • The second possibility is that a itself is an entrance to a plug, but 
in Case B above we have already counted all possible contributions from connecting p to a 
dot which is the entrance to an existing plug, and so we need not count this again here. The 
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contribution to G k is thus at most 4 • ^ • ^ < since n > 100 whenever the assumption 
k < j^q is relevant. 

We add all contributions for G^: 

Gl+ <(4 + l + 3).H<£, 

and for G fc : 

Gr < (4 + 1 + 3 + 24 + 1) • — < — . 

(2) In each step j < k on average at most f + ^ plugs are completed, by (1), and so after 
k steps the expected number of plugs is at most + 22) < ^(f + f2) = |. 

(3) If after the fcth step, the final dot of our spanning segment is the entrance to a positive 
plug, then this plug may either be one that has existed previously, or one that has just been 
completed. If it is a plug that has existed previously, then in the (k — l)-n-diagram we had 
before the kth step, there is a unique segment S leading to its entrance (which is not the 
plug itself), and let a denote the vacant dot at the beginning of S. In order for us to end up 
at the entrance to the given plug after adding the kth chord, this chord must include a. As 
before, let p denote the concluding dot of the pointer's segment. If p ^ a then we have one 
choice for such unoriented chord. If p = a then in order for us to continue into the segment 
S, then by Lemma 4.3 the other dot b of the new chord must be the entrance to an existing 
plug. Together we see that in order for us to land at the entrance of an existing plug, we 
must choose the second dot for the new chord either as a dot a as described above, which is 
uniquely determined by a plug, or as a dot which is itself the entrance to a plug. By (2) we 
know that there are on average at most \ previously existing plugs, and so this contributes 
at most 2 • \ • ^ to the probability. 

On the other hand, the probability that after the &th step we have landed at the entrance 
of a positive plug that has just been completed, is at most the probability that such a plug 
has at all been completed at the kth step. By (1) this probability is at most -, since the 
expected number of plugs completed is a bound to the probability that at least one plug has 
been completed. Together we get H£ <2-|-|^ + |<|. In the same way, using G k < 
we get H~ < f □ 

Recall that what we have actually used from Proposition 6.4 is only part (2), which 
bounds the total number of plugs. The need for this more detailed analysis is due to the 
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large contribution of existing positive plugs to the completion of new negative plugs in Case C 
with the special configuration. This required that we separate between positive and negative 
plugs in the inductive proof, with a larger bound for the negative plugs. 
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